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TELECOMMUNICATIONS RECEIVER 



The present invention relates to a telecommunications 
receiver employing a new direct sequence code division 
multiple access (DS-CDMA) architecture which allows the use 
5 of fast adaptive algorithms. 

Two adaptive algorithms are commonly in use, the LMS 
and RLS Algorithms^'^'^ and these are described in Appendix I. 

The least mean square (LMS) algorithm (and the closely 
related normalised least mean squares (NLMS) algorithm) is 

10 a stochastic gradient algorithm which has only one 
parameter, the step size pt . The LMS algorithm is 
computationally simple but its convergence rate is slow and 
highly dependent on the properties of the input signal, more 
specifically on the eigenvalue ratio of the autocorrelation 

15 matrix . ~ When 'many~element:s "of "the input signal "are "unknown, 
for example the channel in a mobile communications system, 
it is difficult to choose fx. The algorithm is numerically 
stable, but an inappropriate choice of can cause 

instability. In high noise conditions, the eigenvalue ratio 

2 0 of the autocorrelation matrix is low and this can help with 

convergence . 

The recursive least squares (RLS) algorithm is 
computationally much more complex, but has much faster 
convergence than the LMS algorithm. It has two parameters, 
25 the forgetting factor X and the initial diagonal matrix term 
6. The forgetting factor is set appropriate to the rate of 
change of the autocorrelation of the input signal. The 
diagonal term has little effect on the algorithm once 
converged, but does affect the size of internal variables 

3 0 within the algorithm during initial convergence. The RLS 

algorithm. is usually considered converged within a number of 
iterations equal to twice the filter length, which is 
generally much faster than the LMS algorithm. The RLS 
algorithm can become numerically unstable when the 
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autocorrelation matrix of the input signal is close to being 
a singular matrix. 

There are a few much less common adaptive filter 
algorithms and the use of these algorithms has been found 
5 desirable. The fast a-posteriori error sequential technique 
(FAEST)=-« algorithm and its stabilised version the SFAEST', 
which are also described in Appendix I, have a convergence 
rate close to the RLS algorithm but complexity close to the 
LMS algorithm. They do however impose an additional 
10 constraint: the input signal must have a shift invariant 
property. The shift invariant property simply means that 
the input signal must; be the same as the input signal on the 
previous iteration shifted on by one sample, with only one 
new sample. This property is not satisfied by the 
15 conventional architecture for a minimum mean square error 
(MMSE) receiver for a DS-CDMA system^. The numerical 

stability of the FAEST algorithms is not as well understood 
as for LMS and RLS, but in practice the SFAEST algorithm 
seems to remain stable for a sufficiently long period of 
20 time for the purpose proposed here. The Fast Newton 
algorithm (see Appendix I) is an algorithm which can 
simplify the calculation of any of the above adaptive filter 
algorithms if the input signal can be modelled as an 
autoregressive filter with order less than is assumed by the 
25 above filters. 

The conventional architecture for the uplink and 
downlink of a DS-CDMA system with an adaptive filter 
receiver is shown in Figure 1. m this architecture, the 
training of an adaptive FIR filter 1 of length N+P-l chips 
(N being the number of chips per data bit and P the total 
number of chips in the code) is done at the bit rate, using 
an adaption error found by the algorithm to be the 
difference between data from a particular user and a sampled 
estimate of the data from the output of the filter 1, i.e. 
the filter has an effective training path ETP. The contents 
of the filter 1 change completely from one iteration thereof 
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to the next. This means that convergence is slow and it is 
not possible to use the FAEST or SFAEST algorithm because 
the shift invariance property is not satisfied. with the 
LMS algorithm, convergence is too slow and the time taken to 
5 reconverge when a user switches on or off is far too slow 
This architecture does work reasonably well with the rls 
algorithm, although convergence is still not very rapid and 
the computational complexity is very high. 

It is one aim of the present invention to provide a 
10 DS-CDMA receiver using an adaptive filter in which the 
convergence is rapid. it is another aim of the invention to 
allow the use of the less common adaptive algorithms which 
has not hitherto been possible. 

. .^""^ P^^sent invention provides a direct sequence code 
15 division multiple access (DS-CDMA) rec_eiver comprising an 
adaptive filter for filtering data which "has been multiplied 
by a spreading code, the filter having a length equal to the 
length of a channel via which the data is transmitted to the 

20 ITT'^: " -^^i— detector operating on the out.ut 

20 Of the adaptive filter. 

The algorithm is preferably operable to recalculate 
and use data either from the spread-multiplied signal of a 
desired user only, or from a composite signal which is the 
sum Of the spread-multiplied signals of all transmitting 
25 users. This means that the adaptive filter will be trained 
by new information at the chip rate of the code. 

in a particular embodiment of the invention the fixed 
multiuser detector is of the minimum mean squared error 
(MMSE) type, but it may alternatively be of the zero forcing 
30 (decorrelating) , Volterra, Radial Basis function, 
cancellation, near optimum or other decoding types. 

The algorithm can for example comprise the least mean 
squares (LMS) or recursive least squares (RLS) algorithm. 
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However, because the adaptive filter of the invention 
satisfies the shift invariance property, the algorithm may 
alternatively comprise the fast a-posterion error sequential 
technique (FAEST) algorithm, the stabilised FAEST (SFAEST) 
5 algorithm, and the above algorithms or others may be used in 
combination with the Fast Newton algorithm. 

In order that the present invention may be more 
readily understood, reference will now be made, by way of 
example only, to the accompanying drawings, in which:- 

^0 Figure 1 shows the conventional DS-CDMA architecture 

already discussed; 

Figure 2 shows DS-CDMA architecture according to an 
embodiment of the invention; and 

Figures 3, 4 and 5 are graphs of simulation results, 
15 showing respectively the comparative convergence of the 
architectures of Figures 1 and 2, the relative convergence 
rates of different algorithms using the architecture of 
Figure 2, and the bit error ratio (BER) results for the 
architecture of Figure 2 . 

2 0 Figure 2 shows DS-CDMA ' transmitter and receiver 

architecture comprising spreading means 2 respectively 
operated by each user in which a data signal from the user 
is multiplied by one of a set of spreading codes uniquely 
allocated to the respective user. The data is supplied to 

2 5 the spreading means at its bit rate and the code is input at 

its chip rate, there being N code chips for each data bit. 

The spread-multiplied data is summed at 4 and filtered 
in a finite impulse response (FIR) channel filter 6 of 
length P chips, the output of which is transmitted, with 

3 0 incidental Gaussian noise, to a receiver comprising an 

adaptive FIR filter 8, also of length P chips. 
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An adaptive algorithm 10 controls and trains the 
adaptive filter 8. The algorithm 10 basically finds an 
error signal which is the difference between (a) either the 
spread-multiplied data from the desired user only or the 
5 composite spread-multiplied signal, and (b) the output of 
the adaptive filter 8. The choice of training data in (a) 
can either be preset or switchable. The effective training 
paths ETP for the adaptive filter, representing this 
operation of the algorithm are shown in Figure 2 . 

10 The adaptive filter 8 acts similarly to (but not 

exactly the same as) a conventional equaliser. As 
previously stated, training can be on the desired user's 
signal only, or on the composite chip rate signal. The 
filter 8 has less taps than the adaptive filter 1 in the 
15 conventional architecture and is trained at the chip rate, 

which ^means much-f aster-convergence-r - The adaptive-filter 8 

also satisfies the shift invariance property required for 
the FAEST and SFAEST algorithm and this allows LMS 
complexity levels with RLS convergence rates. A fixed 
20 multiuser detector 14 is precalculated and stored at the 
receiver and is based on knowledge obtained about the number 
of users in the system and their spreading codes. This 
detector 14 can be of the MMSE type^ (used in the comparisons 
here) or it can be of a different type, for example zero 
25 forcing or decorrelating^^- Voterra^^- Radial Basis function^^ 
cancellation based^^- or near optimum decoding based on 
Viterbi decoding^^ . 

Summary of Multiuser Detection Techniques 

The conventional single user detector or spreading 
3 0 code matched filter calculates the correlation between the 
received signal and the spreading code (or the spreading 
code convolved with the channel impulse response in a 
multipath channel) over the data bit period. The multiple 
access interference (MAI) impacts the ability to recover the 
3 5 desired transmissions with this receiver and is more 





- 6 - 

pronounced for high power interfering users. 

The matched filter is the optimum detector in AWGN but 
the MAI cross-correlation terms are not Gaussian unless 
there are many active users. In this situation the optimal 
5 detector is not the conventional matched filter but rather 
a form of multi-user detector. 

The detector that yields the most likely transmitted 
sequence maximizes the probability that it was transmitted. 
When all possible transmitted sequences are equally 

10 probable, this is the maximum likelihood sequence estimator 
(MLSE) - When implemented with a Viterbi algorithm the 
complexity is exponential in the number of users. The MLSE 
must also estimate the received amplitudes and phases but it 
lowers the mobile user transmitter power control accuracy 

15 requirement. Despite the performance and capacity gains 
over conventional detection, the MLSE is not a practical 
solution . 

Here the zero forcing or decorrelating detector 
implements an inverse of the received signal autocorrelation 

20 matrix (neglecting the noise) such that the output is 
completely free of multiple access interference. The 
decorrelating detector was initially proposed in the late 
1970s and it provides substantial perf ormance/capacity gains 
over the conventional detector and does not need to know in 

25 advance the received signal amplitudes, avoiding sensitivity 
to estimation error. It has computational complexity 
significantly lower than MLSE. Another desirable feature is 
that it corresponds to the. MLSE when the user energies are 
unknown. A disadvantage of this detector is that it causes 

3 0 noise enhancement i.e. the power associated with the noise 
term at the output of the decorrelating detector is always 
greater or equal to the noise term at the output of the 
conventional detector. A more significant disadvantage of 
the decorrelating detector is that the computations needed 

35 to invert the matrix are considerable. 
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A possibly superior linear receiver structure is to 
build an adaptive filter that minimises (at its output) the 
error power. This implements a partial or modified inverse 
of the autocorrelation matrix, dependent on the level of 
5 background noise, to balance the desire to decouple the 
users for MAI reduction while not enhancing the background 
noise. Again this receiver structure implements a matrix 
inversion operation and here we can apply the recursive 
adaptive filter techniques. This detector differs from the 

10 decorrelating detector in that it takes the noise terms into 
account to minimise the noise enhancement. Thus if noise is 
low the minimum mean squared error (MMSE) receiver 
approaches the decorrelating detector and achieves close to 
optimum performance. On the other hand, if the multiple 

15 access interference is small compared to. the noise, then the 
matched filter detector solution is approached. This is 
exactly analogous to the MMSE linear equalizer used to 
combat inter-symbol interference. Unlike the decorrelating 
detector, it requires estimation of the received amplitudes 

2 0 but its complexity is independent of number of active users 

and explicit knowledge of the CDMA spreading sequences is 
not required for an adaptive implementation. 

Other researchers have investigated radial basis 
25 function (which in its complete implementation is similar to 
MLSE, various simplifications have been suggested) and 
Volterra nonlinear approaches (where the received signal is 
passed through a power series nonlinear expansion before 
applying a linear filter eg. decorrelating or MMSE) . Note 

3 0 that if a MMSE error type fixed multiuser detector is used, 

the overall impulse response of the adaptive filter followed 
by the fixed detector is not exactly the same as for the 
conventional architecture even when both systems are fully 
converged. This will give different bit error ratio (BER) 
35 results even when both the adaptive filters in the two 
receivers are fully converged. Note also that the adaptive 
filter 8 has an extremely low signal to noise ratio at its 
output as the processing gain has not been applied at this 
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stage. This leads to a low eigenvalue spread which can help 
the convergence and stability of some adaptive algorithms 
but others exhibit instability in the extremely high noise 
conditions 

5 Possible multiuser detectors are discussed in more 

detail in Appendix II, 

The output from the detector 14 is down- sampled to the 
bit rate at the synchronous points to give the required 
estimate of the user's data signal. 

10 Simulation Results 

All simulation results presented below are for 16 chip 
spreading codes (i.e. P=16) and a six tap stationary 
channel. Firstly we show that the new architecture has far 
faster or convergence than the conventional one. Figure 3 

15 shows convergence curves for the conventional architecture 
of Figure 1 and the new architecture of Figure 2 . The new 
architecture is much faster than the conventional one, 
mostly because it is trained at the chip rate instead of the 
bit rate but also because the eigenvalue ratio of the 

20 autocorrelation is reduced in high noise. 

The graph of Figure 3 shows ensemble averaged squared 
error at the output of the adaptive filter in both 
architectures plotted against time measured in chips, for 
the single user case. Both curves use the LMS algorithm 

25 with the value of fi individually optimised for each. The 
use of chip rate training instead of bit rate training, 
increases the convergence by a factor of 16, but in fact the 
convergence is more than 16 times faster because high noise 
reduces the eigenvalue ratio in the autocorrelation- The 

3 0 new algorithm converges to a MMSE 16 times higher than the 
conventional architecture, but this loss will be largely 
regained through the fixed multiuser detector. 



Figure 4 shows the relative convergence rates of some 
different adaptive filter algorithms using the. architecture 
of Figure 2. The use of the SFAEST algorithm is only 
possible in the new architecture, as the conventional one 
does not satisfy the shift invariance property. 

Relatively speaking, the LMS algorithm is very slow. 
It can be made faster by increasing the value of pt but when 
this value is increased too far, the LMS algorithm does not 
converge to the MMSE error floor (misadjustment error) . The 
RLS is fast, but at the price of high computational com- 
plexity. The SFAEST algorithm is as quick as the RLS 
algorithm if it is initialised appropriately. 

Figure 5 shows BER results for the new architecture 
after allowing 160 chips (only 10 databits) for convergence 
-o-:^the-adaptive "filter ~ 

With only 16 0 databits for convergence, the LMS 
algorithm is not fully converged when the training period 
ends and this results in a slightly poorer performance than 
is obtainable with the conventional architecture. The RLS 
and SFAEST give similar results. 

The architecture of the receiver of the invention is 
much faster converging and tracking than the conventional 
architecture with only a small loss in BER performance for 
a stationary channel. In a time varying channel, average 
BER will be better for the new architecture because of 
reduced tracking errors. The fast adaptive algorithms 
(FAEST, SFAEST, preferably in combination with Fast Newton) 
allow the new architecture to achieve good convergence 
without the computational complexity of the RLS algorithm. 

The receiver of the invention could be integrated in 
a "hard-wired" form or could be made capable of being 
updated by using reconf igurable or replaceable firmware. 



APPENDIX I 



Summaries of Adaptive Algorithms 



Least mean square algorithm From the stochastic gradient approach [1] one obtains the well 
known least mean squares (LMS) algorithm which was first introduced in 1960 by Widrow and Hoff 
[2]. This algorithm is uncomplicated and yields acceptable performance in most cases. The LMS 
technique is probably the most frequently used adaptive algorithm in current communication system. 
It is derived by applying the method of steepest descent minimisation to the Wiener-Hopf equations 
which define the optimum Wiener filter and one obtains a simple recursive scheme of the form [3] 

f new I ^ r old I f learning 1 f new | 
\ tap weights J ~ \ tap weights J \ rate J \ information J 

where the, new. information consists of the product of the filter input vector and the error 
signal, i.e. the difference between the desired filter output and the actual output of the filter. 
Essentially one can express the LMS algorithm (and many other adaptive algorithms) by means of 
three expressions [1]: 

Filter output : y{i) = w(0^x(i) (2) 
Adaption error : e{i) = d{t) - y{t) (3) 
Tap weight update : w(t + 1) = w{t) + /ix(t)e(i) (4) 

where x is the filter input vector, the transposed tap weight vector, d represents the desired filter 
output and jj. is the learning rate (step size parameter). 

Several modifications have been made to improve the LMS algorithm, the most important one 
resulted in the normalised LMS (NLMS, [4]) which normalises the adaption error e using instantan- 
eous estimates of the input vector power ||x|l^ . Doing so largely improves the algorithms stability 
characteristics and allows faster convergence. The LMS algorithm is computationally simple but its 
convergence rate is slow and highly dependent on the properties of the input signal, more specifically 
on the eigenvalue ratio of the autocorrelation matrix. When many elements of the input signal are 
unknown, for example the channel in a mobile communications system, it is difficult to choose 
The algorithm is numerically stable, but an inappropriate choice of /x can cause instability. In high 
noise conditions, the eigenvalue ratio of the autocorrelation matrix is low and this can help with 
convergence. Other versions of the LMS such as leaky, signed and quantised LMS [3] were developed 
but only the NLMS achieved sufficient stability at acceptable performance and is therefore considered 
as a candidate for application in advanced multiuser detection. 

To illustrate the improved performance of the normalised LMS as compared to the standard LMS 
we will present the performance of both the LMS and NLMS later in this document report. 
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ix.ecursive least square algorithm The most popular least-square (LS) technique is is probably 
the recursive least squares (RLS) algorithm. The RLS algorithm is computationally much more 
complex, but has much faster convergence than the LMS algorithm. It has two parameters, the 
forgetting factor A and the initialisation factor 6 for the diagonal matrix P (0), see Equations below. 
The forgetting factor is set appropriate to the rate of change of the autocorrelation of the input 
signal. The diagonal term has little effect on the algorithm once converged, but does effect the size 
of internal variables within the algorithm during initial convergence. The RLS algorithm is usually 
considered converged within a number of iterations equal to twice the filter length, which is generally 
much faster than the LMS algorithm. The RLS algorithm can become numerically unstable when 
the autocorrelation matrix of the input signal is close to being a singular matrix. 
We can summarise the standard RLS algorithm as follows [1]: 



Initialisation 



P (0) = s-'^i 
w (0) = 0 

Subsequent iterations 



^^^^ ~ A + x2'(«)P(t- l)x(0 

e(t) = - w^(t- l)xiv(«) (6) 
w (f) = w (f - 1) + s (i) e {t) (7) 

P (f) = i (P - 1) - g it) it) Pit- 1)) (8) 

The vector g (<) is known as the gain vector of the algorithm, P (0 represents the inverse of 
the correlation matrix * it) = E!=i ^*"'x (*) x(t)^ which can be computed in a recursive manner by 
means of the matrix inversion lemma. The variable e denotes the a-priori error of the filter estimation 
and the filter weights are again represented by the vector w. 



Fast a-posteriori error sequential technique (FAEST) The fast a-posteriori error sequential 
technique (FAEST) has first been reported by Carayannis et al [5] in the context of least-square (LS) 
filtering. Compared to standard LS algorithms the FAEST uses a different approach for calculating 
the Kalman gain vector based on the a-posteriori error formulation rather than the a-priori error 
formulation as used in fast Kalman algorithms [6]. Assuming the shift invariance property of the input 
signal and introducing a slightly modified version of the Kalman gain, the FAEST algorithm manages 
to perform a direct updating of the Kalman gain without invoking matrix- vector multiplications. 
The Kalman gain update according to the FAEST algorithm is summarised in the Table 1. 



Table 1: The FAEST algorithm. 





Multiplications 


Divisions 


ev [t) = x{t) - a.pj [t - IJ Xjv {t - 1) 


/V 
I y 








1 

X 


a{v (i) = ^o^'h («-!) + W it) 


2 






2 


1 


w/v+i [t) - _ 1) - -r,.Mt-i) -a/v {t - 1) . 




1 


a,v (t) = Oiv (i - 1) - sir (t) w^v {t - 1) 


N 






2 






1 


1 


IN (t) = & it) IN+I it) 


1 








1 


it) = XaOj^ (i - 1) + e% it) s% (i) 


2 




'^N it) -b/V it - 1) 


N 




hN it) = biV - 1) - S%f it) W;v it) 


N 




Total number of multiplications and divisions 


5N + 11 


5 



It is well known that the FAEST algorithm suffers from sever stability problems and we will therefore 
not describe this algorithm in more detail but focus on its stabilised version, the SFAEST. For more 
information about FAEST please refer to [5]. 
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^abilised FAEST The stabilised version of the FAEST algorithm has been derived by [7] and is 
presented in Table 3. For easier reading of the equations building the stabilised FAEST (SFAEST) 
al<'orithm, we will first list all variables occurring and explain their meaning with a few words, see 
Table 2. 



Table 2: Variable definitions of the SFAEST algorithm. 



Variable Name 


Definition 


x{t) 


Filter input at time t 


y{t) 


Filter output, y{t) = OVt < 0 




Input vector . . . , x(i - iV + 1)] 




Dual Kalman gain vector 




Forward predictor coefficients 


b/v(t) 


Backward predictor coefficients 


hivW 


Filter coefficients 




A-priori filter error 




A-posteriori filter error 




Forward predictor error and its corrected version 




Backward predictor error and its corrected version 




Forward prediction error power 




Backward prediction error power 




0 < lN{t) < 1 








Difference of errors and the corrected error difference 


A 


exp. forgetting factor 0 < A < 1 



Table 3 shows the successive steps of the SFAEST algorithm and ^Iso evaluates the complexity 
in terms of multiplications and divisions required. 

Initialisation, stabilisation and optimisation issues related to the SFAEST The value of 
the forgetting factor A largely depends on the rate by which the input signal changes and defines 
the memory length of the algorithm. The eigenvalue spread of the weighted input covariance matrix 
given by 

R,v(0 = ^A^--Xiv,rX^,. (9) 

T=0 

plays an important role when determining A. A suitable value could for example be A = 0.98 but 
the type of the input signal needs to be considered and it is most likely that changing between static 
and dynamic users and static and dynamic channels will require different values for A. 
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Table 3: Tasks of SFAEST in order of computation and number of required MUL/DIV operation 



Tasks of SFAEST MUL 


DIV 


Available at time t: 

gN{t - 1), a,v(t - 1), biv(t - 1), hN{t - 1), X!vit - 1) 
7A^(t-l),a/v(t-l), atv(i-l) 






New Information: 

x{t),y{t) 






Computation of the residuals and corrections: 
A-priori forward/ backward errors (residuals): 

1) ei;{t) = x{i) - a,v(i - l)^Xiv(t - 1) 

_v h / m\ Ar\ L /a 1 A T* /J.^ 

2) = x(i - N)- bM{t - Ij^x^vlO 


N 

A/" 


- 


Normalisation parameters: 

4) 5;v(i) = 1 + 7iv+i We5vW(gi;?(< - 1) + - 1)"") 

5) 7.v(0 - ^ and 7iv(0 = a-g^O)x.(t) 

6) ifc,v(i-l) = A-^77v(f-l)aj:;(i-l) 


4 
3 

2 


1 

1 
1 


Difference of^rrors: 

7^ f\T(t) = e^lrCf) + kM(t — l')e{,f<') + XairU - l)gw(* - 1) 


3 


- 


Difference of errors of corrected filters 

^iVl^J - l+p(l-'7N(t))+pfc^v(<-l)(l-'7^^(<-l)) 


4 


1 


Corrected a-priori forward/ backward errors: 

9) ei,{t) = - (1 - lN{t - l))A:,v(t - l)pf7v(0 

10) a^^{t) = Xai,{t~l) + 'yN{t'l){ei,{t)y 

11) e*,(^) = e%{t) - (1 - 7NW)Kiv(0 

12) atv(0 = - 1) + 7N(0(^Ar W)' 


3 
3 
2 
3 


- 
- 


Time update of the duaJ Kalman gain giv(*): 
Extended Kalman gain vector: 

i^J giv+U^J g,v(i-l) 1 Aa^(t-i) -a,v(t-l). 


iV + 1 


1 


Forward filter: 

14) a,v(0 = aiv(t - 1) - 7iv{* - l)(e{,(0 + - l)pfiv(*))gN(^ - 1) 


iV + 4 





Dual Kalman gain: 

.cx r S^v(i) 1 „ /.X „iV+i/^\ r -biv(i- 1) 
1 0 1 ^ SA^+i(*) -Syv+i(^) 1 


iV 




Backward filter: 

16) bv(0 = h^it - 1) - A,vW(e^^(t) +p^,vW)g7vW 


iV + 3 




Time update of the filter hjs/{t): 

18) e^it) = y(^) - h^(Oxyv(0 

19) ej^fii) =7iv(0eiv(0 

20) h,v(0 = hiv(i - 1) - €N(Ogiv(0 


N 
1 




Total number of multiplications and divisions: 8iV + 36 






-15- 

The choice for the variable p is not straight forward and no direct rule for calculating an appro- 
priate value exists. In [7] it is mentioned that a initialisation can be done by means of an estimate 
siich as 



p = po r with po ^ 0.05. (10) 

1 — A 

This could however not be confirmed by simulations carried out in the course of this work. Usually 
a value of /? = 1 has been chosen but an adaptive estimation of p during operation of the algorithm 
might prove advantageous. The most crucial parameters were found to be the forward/ backward 
error powers. Initialisation has been done by means of the following rules: 



a^^(Q)^^X^ (11) 
aJv(0)=M (12) 

As can be seen both error powers depend on the value p. and hence this parameter needs to be 
initialised with care. A typical, stable range for p was found to be between IQ < fx < 100. From 
a stability point of view it is advantageous to monitor the evolution of the variable 7Ar(i)- To 
prevent divergence of the algorithm, this variable should be restricted to values between 0 and 1. A 
recommended rule [7] to assure convergence is to reinitialise the algorithm when the condition 

becomes true, where z; is a small constant. This precaution will also take care of the case where the 
gain tends towards zero, hence — 0. 
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Fast Newton transversal Siter The Fast Newton algorithm is an algorithm which can simplify 
the calculation of any of the above adaptive filter algorithms if the input signal can be modelled as 
an autoregressive filter with order less than is assumed by the above filters. Fast Newton transversal 
filters originate from the area of speech enhancement and echo cancellation. Their main feature is a 
fast calculation of the gain vector as required in many LS adaptive algorithms. 

The stabilised fast Newton transversal filter (SFNTF) is essentially a computational accelerator 
for any least square (LS) algorithm. Operating as a "higher-level" adaptive predictor it can use 
any LS algorithm as a subroutine within its own algorithm. However, the order of this LS filter 
can be chosen to be smaller than the actual filter order of the SFNTF algorithm, A sophisticated 
predictor part then extrapolates the remaining filter coefficients to gain the complete set of filter 
coefficients as required according to the definition of the SFNTF length. It is this feature that should 
make the SFNTF a potentially attractive adaptive algorithm for many application and the usefulness 
of SFNTFs for advanced MUD receivers shall be examined in the future. As far as this report is 
concerned, we will however restrict ourselves to the description of the algorithm only. 

To discuss the details of the SFNTF we will again list all variables as present in the algorithm 
with some words of explanation about the meaning of the variables. Table 4 lists these definitions. 



Table 4: Variable definitions of the SFNTF algorithm. 



Variable Name 


Definition 


x{t) 


Filter input at time t 


vit) 


Filter output. y{t) = OVt < 0 




Input vector [x(t), x(t - N + 1] 




Dual Kalman gain vector of order N 


qAr(^),qAr+i(0 


temporary vectors to compute gjv(^) in version 
2 and 3. 




temporary vectors to compute g;v(i)- 




Forward predictor coeflicients 




Backward predictor coefficients 




Filter coefficients 




Input sample covariance matrix 




Vector that builds RiV, see end of table. 


eNit) 


A-priori filter error 




A-posteriori filter error 




Forward predictor error and its corrected version 




Backward predictor error and its corrected ver- 
sion 




Forward prediction error power 




Backward prediction error powerr 


7iv (^),7iv+i {t) 


0 < 7iv(i) < 1 








Difference of errors and the corrected error dif- 
ference 


A 


exp. forgetting factor 0 < A < 1 



We can now describe the SFNTF with all its equations. 

For easier reading of the following equations, we first define two new vectors s and u 
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Sp+l(<) = T- 



e-pit) 



>^aUt-l) I -ap(«- 1) . 



u 



P+i 



Xa''p(t- 1) 



-bp(f-l) 
1 



t° = t- N + P 



(14) 



(15) 



Comparing these definitions with Equation 4, we notice the similarity with the updating part 
/ixe. While A represents the forgetting factor sinnilar to fj., the second factor of Equations 14 and 15 
are normalised error signals and the third factor, containing the vectors a and b, are similar to the 
input vector x of Equation 4. Note however that this is only a basic attempt to explain the nature of 
the vectors s and u and not a completely valid comparison - in particular with respect to the third 
factor. The vectors a and b represent the forward and backward predictor coefficients of the FNTF, 
respectively, and are computed by means of the sample covariance matrix Rp by 



ap(0 = Rp'(t-l) 



and 



(16) 



xi(i-P + l) _ 

Their corresponding predictor error powers can then be defined as 



(17) 



and 



a^(0 = x°(i) - [ xM«) ... x^it) ]ap(f) 



a^(f) = x°(i-P)- [ xi(i) ... xP{t-P + l)]hp{t) 



(18) 



(19) 



As we notice from the definitions above, the SFNTF operates two separate predictor branches 
and by enabling or disabling those branches we can create three different versions of the algorithm: 

Version 1: Using forward and backward predictors (not recommended) 

Available values at time i: giv(^ — 1) and 7;v,p(t — 1) 

From SFAEST algorithm : sp+i(<), e^(i), up+i(i°) and e^p{t°) 



0 



. giv(i 



sp+i(i) 
On-p 



On-p 
up+x(i°) J 



iNAt) = iN,p{t - 1) + s)=+i(o4w - up+i(i°)4(*°) 



(20) 

(21) 
(22) 
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Version 2: Using only forward predictors 



Available at time t: qiv(i — l). 5yv(* — 1) 

From SFAEST algorithm: sp+i{t), ej,{t), sp+i{t°), e^,(f°), gp(f°),7P(i°) 



qiv+i(f) = 



0 



. qiv(<-i) . 
qiv+i(<) - 



+ 



On-p 



On-p 

On-p 
[ sp{t°) J 

= 9N{t - 1) + s}>+i(t)e^(f) - s^+i(f°)e;^(t°) 
7iV,p(<) = 7P(*°) + gN{t) 



SN(t) = 



- qiv(0 



Version 3: Using only backward predictors 

Available at time t: qiv(* — ^)i9N{t — 1) 

From SFAEST algorithm: up+i(<), e^pit), up+i(f°), e^(<°), gp(t°),7P(i°) 



(23) 
(24) 

(25) 

(26) 
(27) 



qiv+i(t) 



L qiv(i - 



1) J 



+ 



[ ] = 



UP+l (<) 

0^-p 
Ojv-p 1 



SN{t) = 



[ up+l(*°) 

- qiv(i) 



q^v+i (*) - 

r spit) 
L Oiv_p J 

9N{t) = gN{t - 1) + u^X\{t)e''p{t) - u^+l(f°)e^(«°) 
7iV,p(0 = 7P(<) +5iv(*) 
Time update of the filter hiv(i): 

Available at time t: h/v(t — l),xjv(t — 1) 
New information: x{t),y{t) 



(28) 

(29) 

(30) 
(31) 



eN{t) = y{t) - h^(i - l)x^(t - 1) 
e;v(t) = 



(32) 
(33) 
(34) 



lN,p{i) 

hN{t) = hiv(f - 1) - €N{t)SN{t) 

Predictor version 1 is more likely to diverge due to the fact that the calculation of 7 is not realised 
as a finite sum. Versions 2 and 3 include only a finite sum of P + 1 values in the computation of 7. 

As the algorithm of Table 5 is rather complex and difficult to overview, we also include Table 6 
which displays the variable occurences in the different equations of Table 5 and states which variables 
are required for updating other variables. 

'Note: The definitions of 7 differ between FAEST and FNTF! 




Table o: Tasks of SFNTF with SFAEST in order of computation and number of required MUL/DIV 
operations. 



SFNTF with SFAEST 



Available at time t: 

gp{t - 1), a.p{t - 1), bp(^ - 1), h^.t_i, Kp{t - 1), 
7p(f-l),a^U-l),Q^p(i-l) 



New Information: 

x(t), y{t) 



MUL 



DIV 



Computation of the residuals and corrections: 
A-priori forward/backward errors (residuals) 

1) e/(t) =z(t)-ap(<-l)^xp(f-l) 

2) etp(f) = x{t -N)- hp{t - l)^xp(f) 



P 
P 



Normalisation parameters 

4) 5p(f) = 1 + 7P+1(04W(S?(< - 1) + I^Sl)^^^^ - ^^""^ 

5) 7PW = ^and7PW = Trijfc7(7)' 

6) kp{t - 1) = A-^7p(f - l)ag(f - 1) 



4 
3 



Difference of errors: 

7) ^p{t) = eOpjt) + kp{t - l)e^t) + Aa^(f - l)g^(f - 1) 



Difference of errors of corrected filters 

^) - l+p(l-yf(t))+pA;^p(t-l)(l-7p(t-l)) 



Corrected a-priori forward/ backward errors 

9) e^p{t) = e^,(f) -{l-ip{t-l))kp{t- 

10) o^(«) = \afp{t - 1) + 7f(< -_1)(4(0)'' 

11) ^p{t) = e'p{t)-{l-'rp{t))p^p{t) 

12) a*pW = Aa^pCf - 1) + 7P(0(^(0)' 



Time update of the dual Kalman gain gp(t): 
Extended Kalman gain vector: 



13) 





0 




1 




gp(< - 1) . 




-ap(f- 1) _ 



3 
3 
2 
3 



P + 1 



Forward filter: 
14) ap(0 = ap(f 



1) - ip{t - \){tUt) + kp{t-i)pipmzp[t-\) 



P + 4 



Dual Kalman gain: 

' SP{t) 
0 



15) 



~ „P+i 
= gP+i,t - gp+i.t 



-bp(f-l) 
1 



Backward filter: _ 

16) bp ffl = bp(t - 1) - iPit)ie''p{t)+p^p(t))gp{t) 



Computation of dual Kalman gain for SFNTF: 
17) see descriptio n of Version 1), 2) and 3) 



Time update of the SFAEST filter hp{t): (not required for SFNTF) 
18a) ep{t) = y{t)-h^{t)^p{t) 
19a) €p{t) =7p{t)ep{t) 

20a) hp (t) = hp(t - 1) - 6p (Qgp (t) 



P + 3 



P 
1 

P 



Time update of the SFNTF filter h^v,*: 

18b) eiv(0 = yit) - h^^_^x/v,f-i 

19b) €N{t) = ewit)/lNit) ^ 

20b) hiv.j = hjv,f-i - e,v(Ogjv,t 



iV 



Total number of multiplications and divisions: 



BP + 2N + 39 
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APF DIX II 

Multiuser detectors for CDMA 



THE MMSE RECEIVER 



Consider the direct sequence spread spectrum system shown in Fig. 1. 



Desired 



Data bits D 



Sprcadiog ccxie Cih 
N chips per data bit 



Interfering 
users 




Fig. I: Direct sequence CDMA system with no channel model 

In it we have M {m = I to M) users, each transmitting a FN code of length N chips (n = 1 to N). We 
assume that all the users are both data bit and chip synchronous and that the data modulation D^rt and the 
codes Crrm take the values ±1. The signal at the input to the receiver at chip n is therefore:- 



M 



y(n) = Z D„C^ + Win) 



{1} 



W(n) is the noise sample at chip n. Wiener filter theory [I], [2] states that the optimal set of weights for an 
FIR filter h^^pt is given by:- 

{2} 



0yy is the autocorrelation matrix of the input signal and for a stationary input:- 



{3} 



yin)y{n- 1) 
y(n)y{n-2) 



yHn) 

y(n)yin-\) 



yin)yin-N) 
y{n)y{n'N+i) 
yin)y(n - W + 2) 



yHn) 



,y{n)y{n'N) y{n)y{n~N-h\) 

Where E[x] denotes the expected value of x. We are interested in the FIR filter when it contains no data 
transitions, ie when it only has one data bit from each user in it. Assuming that the data modulation on 
each code is independent, ie £(D^D£,) = 0, A^B by direct substitution of equation { 1 } into equation {3 } or 
analogy with the spatial radar case [3], [4] it can be shown that 

0^-eAg^ + ^72/ {4} 



The matrix Q has dimension NxM and has the codes as its columns, ie:- 







2- 






C21 


C3, . 




C,2 


C22 








C23 


C33 . 






^2N 







2^ denotes the transpose of matrix Q. The matrix A has dimensions MxM and has at position m along 
its leading diagonal and is zero elsewhere. P„ is the power at the receiver of user m. For the rest of the 
section we shall assume = 1 for all m, ie perfect power control with the power of each user normalised 
to 1. The a^l term is the noise term assuming the noise power is a" and the noise is uncorrelated. 

The vector 0xy is given by:- 

yin) 



0^ = E xin) 



yin - 1) 
y(n'2) 



{5} 



x(n) is the desired response. We are only interested in the response when the FIR filter contains the chips 
from one data bit, ie when n = Q and at that point the desired response is the data bit for the desired code. 
Assuming die desired code is m = 1 then x{n) - Dj. Substituting this and equation {1} into equation {5} 
and assuming the the data symbols are uncorrelated yields:- 

Cu 

Ci3 



{6} 



'1^ 

Thus the vector 0^ is the desired code. From equations { 1 }. {3} and {6}, and under the assumptions that 
we have stated, the optimum mean squared error performance is only dependent on the code set chosen. 
The minimum mean squared error at the FIR filter output is given by:- 

We can define the output signal to noise ratio for the filter as:- 

Assuming that the channel bandwidth in Hz is equal to the chip rate in chips per second then we can use the 
energy per bit E^, divided by the noise power spectral density A^o as a processing gain independent measure 
of the signal to noise ratio at the input to our FIR filter. In our case:- 

No Na- 

Figs. 2, 3, 4 and 5 show the theoretical and simulated performance of the Wiener filter calculated as 
above for a set of 7 and 3 1 chip Gold codes. Fig. 2 shows the theoretical and simulated performance of the 
7 chip Gold codes in terms of the output signal to noise ratio from the Wiener filter. Fig. 2. shows that the 
performance of the optimal filter with 4 users is practically the same as the performance of a matched filter 
with no MAI. Looking at the data, at 10 dB output signal to noise ratio, the difference between the 4 user 
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Fig. 2: Signal to noise ratio performance of the Wiener filter calculated for 7 chip Gold codes, 

curve and the matched filter curve is around 0.3 dB. From fig. 2, to have 7 users in a system with a process- 
ing gain of 7 we must have an increase of around 1.8 dB in E^/No to achieve the same performance as a 
matched filter with no MAI. These figures are again measured with 10 dB as the required output signal to 

noise ratio. _ , „ 

Fig. 3 shows theoretical and simulated performance of the 31 chip Gold codes in terms of the output 
signal to noise ratio from the Wiener filter. 
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Signal to Gaussian noise ratio Eb/No (dB) 
Fig. 3: Signal to noise ratio performance of the Wiener filter calculated for 31 chip Gold codes. 

With 16 users, we require around a 0.1 dB increase in E^/A^o over the matched filter with no MAI. 1 
better than the 4 user, 7 chip case, partly because 4/7 is greater than 16/31. When there are 31 users ii 
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chip sysiem, the required increase in Et^IN^ is 1 dB over the matched filter with no MAI. It would appear 
that the 31 chip system enjoys some inherent advantage over the 7 chip system, probably due to the greater 
degree of statistical orthogonality between the 31 chip codes when compared with the 7 chip case. How- 
ever, other factors such as the choice of the desired user or the choice of the code sets cannot be ruled out. 
Figs. 4 and 5 show the simulated bit error rate (BER) performance of the 7 and 31 chip codes. 
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Fig. 4: BER performance of the Wiener filter calculated for 7 chip Cold codes. 
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Fig, 5: BER performance of the Wiener filter calculated for 31 chip Gold codes. 

These graphs show similar results to figs. 2 and 3 in terms of the increase in Ei,/Nq required to accommo- 
date more users. A good approximation to figs. 4 and 5 can be derived from figs. 2 and 3 assuming that 
because of the central limit theorem the noise output from the Wiener filters is Gaussian. 
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Adaptive filter receivers 

In most cases the DS-CDMA channel is non-stationary and as we have already stated the MAI will 
vary with the birth and death of signals. Therefore the FIR filter used in our receiver has to be a time vary- 
ing approximation to the Wiener filter calculated using an adaptive algorithm. We will consider the proper- 
tie's of die two most popular adaptive algorithms, least mean squares (LMS) and recursive least squares 
(RLS) [1]. We shall use the 7 chip Gold code case as our example, as the 31 chip case would involve con- 
siderable extra computation. Fig. 6. shows the convergence properties of the LMS algorithm with two dif- 
ferent values of the adaption parameter /i. 
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Fig. 6: Convergence properties of the LMS and RLS adaptive filters 

The graphs show the feedback error ensemble summed over 100 independent trials. With = 0. 007 the 
algorithm converges to 20% of its inidai value within approximately 20 data bits, but with = 0. 0007 con- 
ve'rgence takes around 100 data bits. In a DS-CDMA cellular system, where the channel varies relatively 
quickly widi respect to the data rate of a single user, this slow convergence will not be acceptable. Fig. 7. 
shows the BER performance of the LMS algoridim after convergence with the same values of fi as for fig. 
6. It shows that the smaller value of ^ produces a good approximation to the Wiener filter, but the perfor- 
mance of the LMS algorithm with the larger value of ^ is around 2 dB worse in terms of the required signal 
to noise ratio for a bit error rate of 10"^ Thus it is difficult to find a value of fi which will provide a fast 
enough convergence without introducing a large residual error into the performance of the filter, even for 
the 7 chip case. The convergence time will be greater if the codes are longer. If we consider a DS-CDMA 
cellular system with imperfect power control and a multipath channel, then the eigenvalue spread of the 
input signal will be increased. This will also have an adverse effect on the convergence time of the LMS 
algorithm. More evidence of this is contained in [5]. Thus the LMS algorithm is unsuitable for this appli- 
cation. Also shown in Fig 6. is the convergence of an RLS algorithm with A = 1 and the diagonal elements 
of the autocorrelation matrix initially set to 0.001. This shows typical behaviour for an RLS algorithm, 
rapid convergence by 2N . The RLS curve in fig. 7 shows diat the RLS algoridim does not add any signifi- 
cant residual error to the Wiener filter solution. Thus the RLS algorithm is potentially more suited to a DS- 
CDMA cellular system. 
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Fig, 7: BER performance of the adaptive algorithms after 1000 iterations allowed for convergence, com- 
pared with the Wiener optimal and a matched filter with no MAI, 

Discussion 

In this section we shall look at the assumptions made in the derivation of the filters and look at apply- 
ing this type of receiver to a cellular system. 

We have assumed that all the channels are data bit and chip synchronous. If the channels are not chip 
synchronous, provided the receiver samples the desired channel synchronously, any change is likely to be 
beneficial as the effective MAI will be reduced. If the channels are not data bit synchronous however, the 
effect is gready detrimental. With transitions of the interfering channels occurring in the middle of the 
desired channel data bits, each interfering channel requires two eigenvectors in the Q matrix. Thus perfor- 
mance is reduced and the system breaks down when the number of users exceeds 0. 5^ instead of N for the 
data bit synchronous case. An interesting case is when the data bits are almost synchronous, for example as 
synchronous as the transmission delays in the system will allow. In this case, it may be possible to ensure 
that any transition occurs a small number of chips (compared with N) from the beginning or end of the 
desired user's data bit. In this case the detrimental effect may not be so great, although this hypothesis 
remains unproven. We have also assumed perfect power control. There is some evidence that this type of 
adaptive filter structure is tolerant to variations in the power of the received codes [6]. The last assumption 
we made is that the data bits are uncorrelated. Provided that the signals are independently generated, idle 
channels are suppressed and that care is taken over the content and timing of control information, this 
should be a reasonable assumption. 

If we are to apply this receiver structure to a cellular DS-CDMA system we need to take into account 
the non- stationary multipath channel and the birth and death of users. To take into account the multipath 
channel, we can use the technique in [7], replacing the impulse response of the spreading process C^n with 
Bmn^ where 5„„ is the convolution of the impulse response of the channel with the impulse response of the 
spreading process and repeat the analysis. This paper already describes the theoretical optimal with multi- 
path evaluated for all users of the system simultaneously. The non-stationary elements of the channel can 
be taken into account by blocks of training data with or without decision feedback in between. The buth 
and death of signals problem will be greatly alleviated if all users are constrained to switch on and off only 
immediately proceeding a training block. The convergence and tracking properties of the adaptive 
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algorithm and speed at which the channel varies will determine the length and frequency of the training 
blocks. However, by employing the RLS algorithm (or the covariance form of the least squares algorithm 
on training blocks [1]). the ratio of training data to information carrying data can be kept reasonable. 

Some desirable properties of this receiver are that it does not require to know the desired or the inter- 
ferers spreading codes, provided the uaining data is known and its adaptive nature will allow it to reduce 
the effect of strong interferers from neighbouring cells and local narrow band interference. 



Conclusions 

We have shown that an FIR filter can be used to separate a desired signal from MAI in a DS-CDMA 
system with only a small degradation in Gaussian noise performance. These filters can be approximated by 
trained adaptive filters. These sub-optimal filters may make practical receivers for a DS-CDMA cellular 
system. 



THE DECORRELATING RECEIVER 

The decorrelating detector is similar to the MMSE detector except that the noise term is not taken 
into account ie. the second term in equation {4} above is neglected in the formulae for 0^^ 

RADIAL BASIS FUNCTION RECEIVERS 

Introduction: In many DS-CDMA communications systems there are three sources of distortion 
when the signal arrives at the receiver, structured multiple access interference (MAI) from other users in the 
system, Gaussian noise which can often be extended to include unstructured interference and time disper- 
sion due to multipath propagation. The simplest DS-CDMA receiver structures are based on matched fil- 
ters for a non-dispersive channel and RAKE receivers for multipath channels. The MAI performance of 
matched filter/RAKE receivers can be enhanced by applying cancellation at the expense of increased 
receiver complexity [8] . Many proposed receiver structures are based on linear equaliser structures. 
Examples include decorrelating receivers which are based on the zero-forcing equaliser and those based on 
the MMSE equaliser [9] . We shall compare the above receiver strucnores with the RBF receiver. This non- 
linear receiver minimises the probability of error when deciding on a data bit. It has already been shown 
that an RBF equaliser has superior performance to a linear equaliser in multipath channels for conventional 
minimum bandwidth signalling[10] and that an RBF filter has superior performance to a MMSE filter in a 
non-dispersive CDMA system [II]. We shall show that the RBF based receivers have the ability to consid- 
erably increase the capacity of a DS-CDMA system when compared with other receiver structures. 
AWGN channel system model: To enable us to compare the wide variety of receivers discussed above we 
shall first consider an additive white Gaussian noise channel. Our system will consist of U independent 
users each transmitting a DS-CDMA signal which is chip and bit synchronous and with all users transmit- 
ting equal power normalised to 1. The data bit transmitted by user u during bit time k will be denoted by 

and die spreading code for user u by C^^^* We will use n to denote the chip number within die code 
which is an integer between 0 and {N-l) where N is the spreading sequence lengtii (processing gain). The 
spreading code set used throughout are 7 chips long and randomly generated. Without loss of generality we 
can assume that the desired user is user 0. The channel model will be a simple additive white Gaussian 
noise (AWGN) model with the noise time series denoted by G{kN + i). Thus the chip rate signal arriving at 
the receiver, denoted Y{kN + i). will be:- 

Y{kN^i)--^^D^{k)Cuj'rG{kN^i) 

at the point where data bit k chip / is received. In the AWGN case there is no need to consider / outside the 
range 0 < / < N as outside this time the signal will contain no useful information relating to data bit k. 
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AWGN K.,tannel receiver structures: We shall consider three main receiver structures, matched filtering, the 
MMSE receiver and the RBF receiver. In all cases we shall assume that the codes and the signal powers are 
known at the receiver. In the non-dispersive case the matched filter receiver is given by an N tap FIR filter 
whose co-efficients Hi are the spreading code for the desired receiver i.e. 

- ^0.1 

The MMSE receiver is also an ;V tap FIR filter, but the co-efficients of this filler are given (in vector form) 
by[12] :- 

is the autocorrelation matrix of the cyclostationary input signal with dimensions NxN. O^j^ is the 
cross correlation vector. The RBF receiver is a nonlinear filter whose estimate of the data output is given 
by:- 



^0 = ^8^ S exp 



=1 



where y(k) denotes the input vector consisting of the N chip spaced input samples at data bit time k and c] 
are the centres. The centres are the noise free input vectors for all possible input data bit combinations. 
There are therefore = 2^ centres. \.\ denotes the length of the enclosed vector (Euclidean norm). <xis the 
standard deviation of the noise, is the value of Dq associated with centre C/. In our simulations we shall 
assume that the centres and weights are known at the receiver. For all three receiver cases we shall also 
consider supplementing the receiver with a single stage of parallel cancellation similar to [8] . 

AWGN channel simulation results: The receiver structures above were simulated using Monte-Carlo simu- 
lation. The graphs show the logjo of the BER averaged over all active users in the system plotted against 
the number of active users. Fig. 8 shows results for E^^/Nq = 9 dB. These figures show that the perfor- 
mance of matched filtering becomes poor as the MAI (the number of active users) increases. The number 
of active users which gives an acceptable BER for a matched filter receiver improves considerably with the 
addition of cancellation, as this filter is clearly interference limited. The MMSE receiver performs better 
than both the matched filter and the matched filter with cancellation. It too improves with the addition of 
cancellation, although the improvement is not as great as adding cancellation to a matched filter. The RBF 
receiver is considerably better than both the matched filter and the MMSE receiver, with or without cancel- 
lation. However, the RBF receiver is not improved by the addition of cancellation. This is because in this 
scenario the RBF is the maximum likelihood receiver and therefore cancellation cannot improve its perfor- 
mance. 

Multipath channel system model: We shall now consider a stationary multipath channel model. The chan- 
nel we shall consider has impulse response:- 

M(z) = 0. 3482 + 0. 8704c"^ + 0. 34822"^ 

All signals are assumed to pass through the same channel, as would be the case in the downlink of a mobile 
radio system. The received signal at the time data bit k, chip i is received by the direct path therefore 
becomes :- 

Y{kN + 0 = 0. 3482f 2* D^Cua + ^(^^ + 0 ] + 

0. 8704| I>„(^)C„j-i + G{kN + i - 1)1 + 0. 1>AZ\^ D„(^)C„.,_2 + G{kN + / - 2) ) 
Vu=o y v«=o / 

Note that if / - ;c is negative then the D„(/:)Q,,-.^ is replaced by D^{k - l)Q.^v+/-jc and if i or i - x is greater 




than N (if we extend the receiver filter length beyond A^) then DMCuj-x is replaced by 
D^,(k + l)Cui-N'X- I'^^s is a direct illustration of ISI caused by the multipath channel. 
Multipath channel receiver structures: We shall base all our new receiver structures on a new set of input 
signal samples y{kN + i) where the range of i is extended to 0 < / < N + 2 to capture all the signal energy 
that originated from data bit k. This combined with the multipath channel will result in ISI from both the 
preceding and following data bit. The equivalent of the matched filter for a multipath channel is an FIR fil- 
ter with A/ +2 taps where the laps are given by:- 

The * represents convolution and Cq and M represent the spreading code of user 0 and impulse response of 
the channel respectively. The MMSE receiver coefficients are given by the same equation as before, how- 
ever, the elements of the (A^ + 2)x(A/ + 2)) autocorrelation matrix ^ and the {N + 2) cross correlation vector 
must be calculated according to die derivation of the MMSE receiver taking into account the multipath. 
The RBF receiver equation is also essentially unchanged, except now the input and centre vectors are of 
length N + 2 and the number of centres is now 2^^ as all possible combinations of the previous, current and 
next data bit must be considered. This number of centres increases rapidly with the number of active users 
U and the computational load will rapidly become impractical. 

Multipath channel simulation results: Graphs for the BER performance of the receiver strucmres described 
in the previous section are shown in fig. 9 for Ei,/Nq = 9 dB. These graphs show similar trends to the non- 
dispersive case. The RAKE receiver rapidly breaks down as the MAI interference increases because the 
interfering users are not taken account of at all in the RAKE receiver and are therefore treated as unstruc- 
tured noise. The MMSE receiver does considerably better than the RAKE receiver. The RJBF receiver per- 
forms better than either the RAKE receiver or the MMSE receiver. However, it does involve a considerable 
increase in computational complexity. The RBF results are truncated at 5 users because the calculation of 6 
users would require evaluadng the Euclidean distance from 2*^ centres for every data bit sent. 
Discussion and Conclusions: We have shown that an RBF receiver has gready improved performance over 
the more conventional matched filter and MMSE based receiver strucmres. This performance increase is 
apparent in both AWGN and multipath channels. However the computational complexity and the number 
of variable parameters in a non- stationary environment presently make the RBF filter receiver impractical 
for mobile radio applications. 
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Fig. 8: BER vs number of active users for E^/Nq = 9 dB in an AWGN channel. 
All users are equal power and the spreading code length N is 7, 
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Fig, 9: BER vs number of active users for Ei,/Nq - 9 dB in a stationary multipath channel 
All users are equal power and the spreading code length N is 7. 



NEAR OPTIMUM RECEIVERS 

Near Optimum receivers use similar methods to the Radial Basis function receiver, usually with sim- 
plifications to the Radial Basis Function by dropping the exponential term and simplifying the Euclidean 
difference expression. These receivers can be implemented at the chip level or the bit level (after pre- 
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proc-ssing) and often involve the Viterbi forward programming algorithm to search for the optimum centre 
(path in the Viterbi algorithm). 

CAiNCELLATION BASED RECEIVERS 

The optimal receiver for a direct sequence code division multiple access (DS-CDMA) system is the 
maximum likelihood sequence estimator[13][14] which is effectively a RBF receiver with infinite memory. 
However, this receiver's complexity rises exponentially with the number of users and is therefore impracti- 
cal. At the other end of the complexity scale, a matched filter is commonly used as the receiver in a DS- 
CDMA system. A matched filter is the optimal receiver only in additive white Gaussian noise (AWGN) 
and has very poor performance when the level of multiple access interference (MAI) from other users in the 
system is high. 

Between these two extremes a range of sub-optimal receiver structures have been proposed. The 
majority of these can be split into two types. Cancellation structures are typically based on matched filter- 
ing followed by subtraction of the matched filter output from a delayed version of the input sig- 
nd[8][15][16] . Equaliser structures use a linear or decision feedback FIR filter with the co-efficients opti- 
mised according to varying criterion such as minimum mean square error (MMSE) or zero-forcing 
[5][7][i7] . A comprehensive review of this area and a reference list can be found in[9] . 

In this section the performance of a Wiener filter receiver is compared with that of a simple parallel 
canceller. The combination of the two techniques is also examined. 

System description 

The system to be considered consists of U independent equal power users transmitting both data bit 
and chip synchronously as shown in fig. 10. The data bit transmitted by user u at bit Ume k will be denoted 
by DM and'the sprracling code for user M-will be denoted-by C„^. n denotes the chip within the code 
which is an integer between 0 and {N-\) and N is the spreading sequence length (processing gain). The 
spreading codes used as an example throughout this paper are 64 chips long and randomly generated. 
Orthogonal (Walsh) or semi-orthogonal (Gold) codes would give better performance, but would require 
longer simulation umes to give meaningful BER results. In many communications systems the channel 
will destroy the orthogonality property of orthogonal codes anyway. Throughout this paper the chip and 
data values will be normalised to ±i: Without loss of generality we can assume that the desired user is user 
0. The channel under consideration will be a simple AWGN channel with the noise denoted by G{kN + n) 
having variance a". Thus the chip rate signal arriving at the receiver, denoted YikN + /i), will be:- 

Y{kN + n) = 2^ DMC^,n + G{kN + n) 



Receiver structures and theoretical performance. 

The four receiver structures shown in fig. 11 will be examined. Receiver structure a) is the simple 
matched filter case. The receiver consists of an N tap FIR filter whose co-efficients h„ are a scaled version 
of the desired users code, ie. 



N 



The matched filter treats the MAI as noise and therefore, if the signal power of each user is one and die 
codes are random, the central limit theorem allows us to approximate the BER for this receiver as:- 



BER„y = erfd 



/ N 



1) 



(1) 



where erfcQ is the complimentary error function. 
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Fig. 10: The construction of the received signal Y(n) 



Receiver structure b) is a simple parallel canceller using matched filters to despread each interfering 
signal and reconstruct a replica which is subtracted from an appropriately delayed input signal, cancelling 
much of the interference. The remaining signal is then despread using a matched filter to the desired user. 
The reconstruction of the signal assumes that the power of the signal is known exactly. Therefore an inter- 
fering signal which is is received with the correct sign will be cancelled exactly and an interfering signal 
received incorrectly will have its amplitude doubled (power quadrupled). A lower bound on the BER for 
this structure is given by:- 



BER = erfc 



(V 



N 



a'-^BER^f(4,0)iU'l) 



(2) 



This equation is only a lower bound for two reasons. Firstly the noise sarhples for each matched filter in 
the cancellation stage are the same, where as the above equation assumes they are independent samples. 
Secondly, after cancellation the combined interference plus noise distribution of the remaining signal is a 
poor approximation to Gaussian because it consists of die Gaussian noise and a limited number of enlarged 
interferering signals which will be a poor approximation to a Gaussian distribution. 

Receiver structure c) is the Wiener, Levinson or MMSE producing filter [2][1] . It is also an N tap 
FIR filter, but the co-efficients of this filter are given (in vector form) by[12] :- 



= <I>"^<1> 



where 




c) Wiener filter d) Wiener filter based cancellation 

Fig, 11: The four receiver structures considered 



The T superscript denotes a matrix transpose. <^yy is the autocorrelation matrix of the input sig] 
dimensions N^N, given by:- 




The matrix Q has dimension NxU and has the codes as its columns, ie:- 



2 = 





^1,0 








Ci.i 


C2,l . 




^0.2 




^2,2 













1 is the NxN identity matrix. <^y^ is the cross correlation vector:- 

^yx = [Q,o» Qj» Q,2 ' * • Q,iV-l] 

The MMSE is given by:- 

MMSE = 1 - h^^y^ 
and the BER performance (derivation A) is:- 
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BERwiencr = CffC 



-(^^^y:r)^ + (2- 0)A^O„;, - L 0 + MMSE 



(3) 



To obtain average BER performance, this equation must be averaged over all the users in the system to give 

B ER Wiener- 

Receiver structure d) is a parallel canceller using Wiener filters for the initial estimate of the interfer- 
ers data bits. By analogy with equation 2, a lower bound on the BER performance of this structure is given 
by:- 



BER = erfc 



f / _ ^ 

\ 0-2 + BERwUneM- 



0)(Z/-1) 



(4) 



Results 

All simulated results in this section are for 60,000 data bits, with the BER averaged over all users. 
Graphs show the average BER for all users in the system plotted against number of users. The sequence 
length is 64. The signal to Gaussian noise ratio is expressed as:- 

Fig. 12 shows a comparison between the theoretical results derived in above and simulation results. 
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Fig, 12: Theoretical and simulated results for Ei,/Nq = 7dB 



The results show that equations (i) and (3) give good and unbiased estimates of the simulated results for 
the matched filter and Wiener filter respectively. The deviation of the simulated results for the matched fil- 
ter from equation (1) with a low number of users is caused by the interference distribution only being 
approximately Gaussian and the limited number of simulated points. Equations (2) and (4) however only 
give an approximation to the actual performance of the two cancellation receivers. This estimate is biased 
towards a lower BER for reasons that were discussed in above. 




Fig. 13 shows simulated results for a range of signal to noise ratios. These results show that a 
matched filter is only a good receiver structure when Gaussian noise is the dominant source of interference. 
This only applies when there are very few interfering users and the signal to noise ratio is low. 

A single stage of parallel cancellation based on matched filters performs better than a matched filter 
except at extremely high levels of interference, where the BER is too low for most conmiunications systems 
anyway. The improvement is fairly significant, with Et,/No=9dB, the cancellation receiver can support 
three times as many users as the matched filter if the required BER is 10"". 

The Wiener filter will always perform better than the matched filter. The only case where the perfor- 
mance of diese two receiver systems is the same is when there is no MAI interference and the Wiener filter 
becomes a scaled version of the matched filter. The Wiener filter also performs significantly better than the 
matched filter based cancellation receiver. With Et,/NQ=9dB and a required BER of 10"^, the Wiener filter 
will allow approximately six times as many users as a matched filter and twice as many as the matched fil- 
ter based cancellation receiver. The computational complexity of the Wiener filter receiver is identical to 
the matched filter receiver, and significantly less than the matched filter based cancellation receiver. This 
assumes that the Wiener co-efficients are calculated in advance. 

The Wiener filter based cancellation receiver gives the best performance of all. It will allow approxi- 
mately seven times the number of users if E^/^o = 9dB and the required BER is 10"^. compared with the 
matched filter receiver. 

Note that in all cases there is a significant variation in performance between users. For example, 
using the Wiener filter receiver with 48 users and Ei,/Nq = 9dB, the average BER is 0.00940 but the user 
with the best spreading sequence experiences a BER of 0.001167 and the user with the worst spreading 
sequence experiences a BER of 0.038092. This variation is due to the cross correlation properties of the 
code set and will also apply to an orthogonal code set which has lost its orthogonality due to a multipath 
channel. 



Discussion and Conclusions 

In this paper we have shown that cancellation and Wiener filtering both perform much better in MAI 
than a simple DMF. Wiener filters in particular offer a large increase in the number of users that can be 
accommodated. The two ideas can be successfully combined and provide a better performance than either 
on its own. 

There is a cost to pay for improved performance. The cancellation receiver introduces a delay of one 
data bit and requires a substantial increase in either computation or hardware. Both the Wiener filter and 
the cancellation receiver require knowledge of the number and spreading sequences of the interfering users 
at the receiver, and the Wiener filter also requires a priori knowledge of die signal to noise ratio. If the sig- 
nal to noise ratio is very high, calculation of the inverse of the autocorrelation matrix for the Wiener filter 
can be difficult as it becomes close to singular. In a computer network application the calculation of the 
Wiener filter in advance may be feasible, but in a mobile communications application the signal to noise 
ratio and the number of users varies rapidly with time. However, a good approximation to the Wiener filter 
can often be obtained using an adaptive algorithm. 

Derivation A 

To calculate the BER for the Wiener filter we require to find a relationship between the MMSE and 
the variance at the output of the filter. 

MMSE = E[{Do(k) - xf] = E[Dl(k)] - (2. 0)E[Do(k)x] + E[x-] 

where £[] denotes the expected value and -r denotes the filter output. DoCk) takes the values ±1, therefore:- 

MMSE = 1 - (2. 0)h'^<Py:c + £1^^] f^-^] 
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Fig. 13: Simulated BER averaged over all users plotted against the number of users. Graphs for four dif- 
ferent signal to additive Gaussian noise ratios ( Ei^/Nq) are shown. Each point on the graphs is the result of 

60,000 data bits per user. The sequence length N is 64. 
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The variance of the filter output is given by:- 

var(x) = E[(x - xf] = E[x-] - (2. 0)E[xx] + E[x'] 

where Jc denotes £[^]. 

varix) = ib'^^^f - (2. 0){h^<py:cf + ^^[^^1 
Subtracting [A.l] from [A.2] and rearranging gives:- 

var(Jc) = - (h^^yj.f + (2. O)^^*^,^, - l.O-^MMSE 
Therefore, assuming the output of the Wiener filter has a Gaussian distribution, the BER is given by 

r 



BER 



Wiener 



= erfc 



var{x) 



= erfc 



'(b^yx)'^ + (2. 0)A^O„ -1.0+ MMSE 



[A.2] 



VOLTERRA RECEIVER 

The Yolterra receiver is made up by a power series expansion of the received signal followed by a lin- 
ear filter such as MMSE where they Volterra expanded signal v is used instead of the original received sig- 
nal y. For a DS-CDMA system with received input signal y(kN + n). the output of the third order Volterra 
expansion is given by:- 

D^sgn^ hi(a)y(kN + n - a) + X 2 h2(a,b)y{kN + n - a)y(kN + n-'b) 
+ X Z X A3 (a. b, c)y{kN + n- a)y{kN + n- b)y(kN + n-c) 

0=0 c==0 

where the h co-efficients are estimated using a linear technique such as MMSE with the autocorrelation and 
crosscorrelation matrices replaced with the autocorrelation and crosscorrelation matrices of the power 
series expanded matrices. 
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